Error Analysis Using Decision Trees in Spontaneous Presentation Speech Recognition
نویسندگان
چکیده
This paper proposes the use of decision trees for analyzing errors in spontaneous presentation speech recognition. The trees are designed to predict whether a word or a phoneme can be correctly recognized or not, using word or phoneme attributes as inputs. The trees are constructed using training “cases” by choosing questions about attributes step by step according to the gain ratio criterion. The errors in recognizing spontaneous presentations given by 10 male speakers were analyzed, and the explanation capability of attributes for the recognition errors was quantitatively evaluated. A restricted set of attributes closely related to the recognition errors was identified for both words and phonemes.
منابع مشابه
A Contribution of Intrinsic Speech Variabilities to Errors Done by Speech Recognition
A usual way of ASR accuracy evaluation is calculation of Word Error Rate (WER) and Sentence Error Rate (SER). The misrecognitions that contribute to WER are classified into three categories: deletions, insertions and substitutions. The paper presents a study about a contribution of intrinsic speech variabilities to the each of the error category. Decision tree (DT) analysis is used. Five DT sty...
متن کاملMulti-accent Chinese speech recognition
Multiple accents are often present in spontaneous Chinese Mandarin speech as most Chinese have learned Mandarin as a second language. We propose a method to handle multiple accents as well as standard speech in a speaker-independent system by merging auxiliary accent decision trees with standard trees and reconstruct the acoustic model. In our proposed method, tree structures and shape are modi...
متن کاملMulti-level decision trees for static and dynamic pronunciation models
We have been focusing on improving pronunciation models for automatic transcription of television and radio news reports by modeling phone, syllable, and word pronunciation distributions with decision trees. These models were employed in two separate sets of experiments. First, decision trees facilitated selection of word pronunciations derived automatically from data for use in a standard spee...
متن کاملClustering beyond phoneme contexts for speech recognition
The clustering of using decision trees is generalized to take into account high-level knowledge sources to better model the co-articulation e ects in large vocabulary continuous speech recognition. VQ models are used to reduce the computational cost in constructing decision trees. The search algorithm is designed such that it can provide a general type of information for decision trees without ...
متن کاملContext-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees
This paper presents a context-dependent hybrid connectionist speech recognition system that uses a set of generalized hierarchical mixtures of experts (HME) to estimate context-dependent posterior acoustic class probabilities. The connectionist part of the system is organized in a modular fashion, allowing the distributed training of such a system on regular workstations. Context classes are ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017